Search CORE

19 research outputs found

Model of random packings of different size balls

Author: Danisch Maximilien
Jin Yuliang
Makse Hernan A.
Publication venue: 'American Physical Society (APS)'
Publication date: 06/06/2010
Field of study

We develop a model to describe the properties of random assemblies of polydisperse hard spheres. We show that the key features to describe the system are (i) the dependence between the free volume of a sphere and the various coordination numbers between the species, and (ii) the dependence of the coordination numbers with the concentration of species; quantities that are calculated analytically. The model predicts the density of random close packing and random loose packing of polydisperse systems for a given distribution of ball size and describes packings for any interparticle friction coefficient. The formalism allows to determine the optimal packing over different distributions and may help to treat packing problems of non-spherical particles which are notoriously difficult to solve.Comment: 6 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Mining bipartite graphs to improve semantic pedophile activity detection

Author: Danisch Maximilien
Fournier Raphaël
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/05/2014
Field of study

International audiencePeer-to-peer (P2P) networks are popular to exchange large volumes of data through the Internet. Paedophile activity is a very important topic for our society and some works have recently attempted to gauge the extent of paedophile exchanges on P2P networks. A key issue is to obtain an efficient detection tool, which may decide if a sequence of keywords is related to the topic or not. We propose to use social network analysis in a large dataset from a P2P network to improve a state-of-the-art filter for paedophile queries. We obtain queries and thus combinations of words which are not tagged by the filter but should be. We also perform some experiments to explore if the original four categories of paedophile queries were to be found by topological measures only

Crossref

HAL-Paris 13

Mesures de proximité appliquées à la détection de communautés dans les grands graphes de terrain

Author: Danisch Maximilien
Publication venue: HAL CCSD
Publication date: 15/06/2015
Field of study

Many kinds of data can be represented as a graph (a set of nodes linked by edges). In this thesis, I show that two major problems, community detection and the measure of the proximity between two nodes have intricate connexions. Particularly, I will present a framework that, using a proximity measure, can isolate a set of nodes. Its general principle is rather straightforward and can be described as follows. Given a node of interest in a graph, the proximity of all nodes in the network to that node of interest is computed. Then, if a small set of nodes have a high proximity to the node of interest while all other have a small proximity, we can directly conclude that the small set of nodes is the community of the node of interest. I'll then show how to tweak this idea to (i) find all communities of a given node, (ii) complete a set of nodes into a community and (iii) find all overlapping communities in a network. I will validate these methods on real and synthetic network datasets.Un grand nombre de données sont représentables sous la forme d'un graphe (ensemble de nœuds liés par des liens). Dans cet exposé, je montrerai que deux problèmes majeurs concernant l'analyse de ces graphes de terrain, à savoir la détection de communautés (définies comme des groupes de nœuds qu'il est pertinent de rassembler) et la mise au point de mesures de proximité (évaluant dans quelle mesure deux nœuds sont topologiquement proches), sont fortement intriquées. En particulier, je présente une méthode qui permet, à l'aide d'une mesure de proximité, d'isoler des groupes de nœuds. Son principe général de fonctionnement est plutôt simple et peut être décrit comme suit. Étant donné un nœud d'intérêt dans le graphe, on calcule la proximité de chaque nœud dans le graphe à ce nœud d'intérêt. Ensuite, si un petit groupe de nœuds obtient une proximité très élevée à ce nœud d'intérêt et que tous les autres nœuds du graphe ont une proximité très faible, alors on peut directement conclure que le petit groupe de nœuds est "la communauté" du nœud d'intérêt. Je montre ensuite comment décliner cette idée pour résoudre efficacement les trois problèmes suivants : (i) trouver des communautés auxquelles un nœud donné appartient, (ii) compléter un ensemble de nœuds en une communauté et (iii) trouver des communautés recouvrantes dans un réseau

Thèses en Ligne

Listing k-cliques in Sparse Real-World Graphs

Author: Balalau Oana
Danisch Maximilien
Sozio Mauro
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/04/2018
Field of study

International audienceMotivated by recent studies in the data mining community which require to efficiently list all k-cliques, we revisit the iconic algorithm of Chiba and Nishizeki and develop the most efficient parallel algorithm for such a problem. Our theoretical analysis provides the best asymptotic upper bound on the running time of our algorithm for the case when the input graph is sparse. Our experimental evaluation on large real-world graphs shows that our parallel algorithm is faster than state-of-the-art algorithms, while boasting an excellent degree of parallelism. In particular, we are able to list all k-cliques (for any k) in graphs containing up to tens of millions of edges as well as all 10-cliques in graphs containing billions of edges, within a few minutes and a few hours respectively. Finally, we show how our algorithm can be employed as an effective subroutine for finding the k-clique core decomposition and an approximate k-clique densest subgraphs in very large real-world graphs

Déplier la structure communautaire d’un réseau en mesurant la proximité aux représentants de communauté

Author: Danisch Maximilien
Guillaume Jean-Loup
Le Grand Bénédicte
Publication venue: 'Lavoisier'
Publication date: 14/10/2015
Field of study

International audienceHow to find all overlapping communities in a complex network? That is, how to find all relevant groups of nodes in a linked dataset? No entirely satisfying solution to that important problem exists, having a criterion to decide which group is relevant and finding quickly these groups in large networks are bottlenecks. We found that in many networks the number of these groups is limited and that there exist, for each group, at least one node that can characterize it by itself: a node belonging only to that group and important within it. We call such a node a community representative. We develop an algorithm to find these overlapping communities. The community detection is done through measuring the proximities of all nodes to the representatives and then finding irregularities in the decrease of these values reflecting the presence of relevant groups. We show that our approach handles very large real-world networks and have comparable or even better performances compared to state of the art methods.Nous proposons un algorithme pour déplier la structure communautaire des grands graphes de terrain. L'algorithme est basé sur la détection de la communauté de chaque représentant communautaire : noeud contenu dans une seule communauté et important en son sein. Cette détection est faite avec une approche à base de mesure de proximité développée récemment. Par comparaison avec d'autres méthodes de l'état de l'art nous montrons que notre algorithme a des performances équivalentes voire meilleures et est capable de traiter les plus grands graphes de terrain

HAL-Paris1

Une approche à base de proximité pour la détection de communautés egocentrées

Author: Danisch Maximilien
Guillaume Jean-Loup
Le Grand Bénédicte
Publication venue: HAL CCSD
Publication date: 28/05/2013
Field of study

International audienceNous proposons ici une approche performante pour déplier la structure communautaire egocentrée sur un sommet d'un gaphe. Nous montrons que, bien que chaque sommet d'un réseau appartienne en général à plusieurs communautés, il est souvent possible d'identifier une communauté unique si l'on considère deux sommets bien choisis. La méthodologie que nous proposons repose sur cette notion de communauté multi-egocentrée ainsi que sur l'utilisation d'une mesure de proximité dérivée de techniques de dynamique d'opinion, la carryover opinion. Cette approche pallie les limites des fonctions de qualité traditionnellement utilisées pour la détection de communautés egocentrées, et consiste à étudier les irrégularités dans la décroissance de cette mesure de proximité

HAL-Paris1

Multi-ego-centered communities in practice

Author: Danisch Maximilien
Guillaume Jean-Loup
Le Grand Bénédicte
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2014
Field of study

International audienceWe propose here a framework to unfold the ego-centered community structure of a given node in a network. The framework is not based on the optimization of a quality function, but on the study of the irregularity of the decrease of a proximity measure. It is a practical use of the notion of multi-ego-centered community and we validate the pertinence of the approach on benchmarks and a real-world network of wikipedia pages

HAL-Paris1

Learning a proximity measure to complete a community

Author: Danisch Maximilien
Guillaume Jean-Loup
Le Grand Bénédicte
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/10/2014
Field of study

International audienceIn large-scale online complex networks (Wikipedia, Facebook, Twitter, etc.) finding nodes related to a specific topic is a strategic research subject. This article focuses on two central notions in this context: communities (groups of highly connected nodes) and proximity measures (indicating whether nodes are topologically close). We propose a parametrized proximity measure which, given a set of nodes belonging to a community, learns the optimal parameters and identifies the other nodes of this community, called multi-ego-centered community as it is centered on a set of nodes. We validate our results on a large dataset of categorized Wikipedia pages and on benchmarks, we also show that our approach performs better than existing ones. Our main contributions are (i) a new ergonomic parametrized proximity measure, (ii) the automatic tuning of the proximity's parameters and (iii) the unsupervised detection of community boundaries

HAL-Paris1

Calculation of the Voronoi boundary for lens-shaped particles and spherocylinders

Author: Adrian Baule
Hernán A Makse
Hoff K
Louis Portal
Makse H A
Maximilien Danisch
Romain Mari
Publication venue: 'IOP Publishing'
Publication date: 01/11/2013
Field of study

We have recently developed a mean-field theory to estimate the packing fraction of non-spherical particles [A. Baule et al., Nature Commun. (2013)]. The central quantity in this framework is the Voronoi excluded volume, which generalizes the standard hard-core excluded volume appearing in Onsager's theory. The Voronoi excluded volume is defined from an exclusion condition for the Voronoi boundary between two particles, which is usually not tractable analytically. Here, we show how the technical difficulties in calculating the Voronoi boundary can be overcome for lens-shaped particles and spherocylinders, two standard prolate and oblate shapes with rotational symmetry. By decomposing these shapes into unions and intersections of spheres analytical expressions can be obtained.Comment: 19 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Queen Mary Research Online

Proximity measure applied to community detection in complex networks

Author: Danisch Maximilien
Publication venue
Publication date: 15/06/2015
Field of study

Un grand nombre de données sont représentables sous la forme d'un graphe (ensemble de nœuds liés par des liens). Dans cet exposé, je montrerai que deux problèmes majeurs concernant l'analyse de ces graphes de terrain, à savoir la détection de communautés (définies comme des groupes de nœuds qu'il est pertinent de rassembler) et la mise au point de mesures de proximité (évaluant dans quelle mesure deux nœuds sont topologiquement proches), sont fortement intriquées. En particulier, je présente une méthode qui permet, à l'aide d'une mesure de proximité, d'isoler des groupes de nœuds. Son principe général de fonctionnement est plutôt simple et peut être décrit comme suit. Étant donné un nœud d'intérêt dans le graphe, on calcule la proximité de chaque nœud dans le graphe à ce nœud d'intérêt. Ensuite, si un petit groupe de nœuds obtient une proximité très élevée à ce nœud d'intérêt et que tous les autres nœuds du graphe ont une proximité très faible, alors on peut directement conclure que le petit groupe de nœuds est "la communauté" du nœud d'intérêt. Je montre ensuite comment décliner cette idée pour résoudre efficacement les trois problèmes suivants : (i) trouver des communautés auxquelles un nœud donné appartient, (ii) compléter un ensemble de nœuds en une communauté et (iii) trouver des communautés recouvrantes dans un réseau.Many kinds of data can be represented as a graph (a set of nodes linked by edges). In this thesis, I show that two major problems, community detection and the measure of the proximity between two nodes have intricate connexions. Particularly, I will present a framework that, using a proximity measure, can isolate a set of nodes. Its general principle is rather straightforward and can be described as follows. Given a node of interest in a graph, the proximity of all nodes in the network to that node of interest is computed. Then, if a small set of nodes have a high proximity to the node of interest while all other have a small proximity, we can directly conclude that the small set of nodes is the community of the node of interest. I'll then show how to tweak this idea to (i) find all communities of a given node, (ii) complete a set of nodes into a community and (iii) find all overlapping communities in a network. I will validate these methods on real and synthetic network datasets

Theses.fr